Picture for Zichen Liu

Zichen Liu

Rethinking the Trust Region in LLM Reinforcement Learning

Add code
Feb 04, 2026
Viaarxiv icon

A DVL Aided Loosely Coupled Inertial Navigation Strategy for AUVs with Attitude Error Modeling and Variance Propagation

Add code
Jan 27, 2026
Viaarxiv icon

RollArt: Scaling Agentic RL Training via Disaggregated Infrastructure

Add code
Dec 27, 2025
Viaarxiv icon

Defeating the Training-Inference Mismatch via FP16

Add code
Oct 30, 2025
Viaarxiv icon

BinCtx: Multi-Modal Representation Learning for Robust Android App Behavior Detection

Add code
Oct 16, 2025
Viaarxiv icon

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Add code
Sep 26, 2025
Figure 1 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 2 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 3 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Figure 4 for Language Models Can Learn from Verbal Feedback Without Scalar Rewards
Viaarxiv icon

Variational Reasoning for Language Models

Add code
Sep 26, 2025
Figure 1 for Variational Reasoning for Language Models
Figure 2 for Variational Reasoning for Language Models
Figure 3 for Variational Reasoning for Language Models
Figure 4 for Variational Reasoning for Language Models
Viaarxiv icon

Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning

Add code
Jun 08, 2025
Viaarxiv icon

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

Add code
Jun 06, 2025
Viaarxiv icon

Reinforcing General Reasoning without Verifiers

Add code
May 27, 2025
Viaarxiv icon